A veto for continuous gravitational wave searches 



Llucia Sancho de la Jordanaj and Alicia M. Sintes|| 

t Dopartamcnt dc Fi'sica, Univorsitat dc los lUos Balears, Cra. Valldomossa Km. 
7.5, E-07122 Palma de Mallorca, Spain 

X Max-Planck-Institut fiir Gravitationsphysik, Albert Einstein Institut, Am 

Miihlenberg 1, D-14476 Golm, Germany 

E-mail: llucia.saiicho@uib.es, sintesOaei .mpg.de 

Abstract. 

vetoes are commonly used in searching for gravitational waves, in 
particular for broad-band signals, but they can also be applied to narrow-band 
continuous wave signals, such as those expected from rapidly rotating neutron 
stars. In this paper we present a veto adapted to the Hough transform searches 
for continuous gravitational wave signals; we characterize the x'^-significance plane 
for different frequency bands; and discuss the expected performance of this veto 
in LIGO analysis. 



PACS numbers: 04.80.Nn, 07.05.Kf, 95.55.Ym, 97.60. Gb 
1. Introduction 

Continuous gravitational wave signals emitted by neutron stars in our galaxy are 
among the targets of on-going searches for gravitational waves using GEO600 [H [2] , 
LIGO [21 H] and VIRGO [5] data. Examples of such searches include targeting 
known radio pulsars O [71 [8] and the low-mass X-ray binary system Scorpius X- 
1 ini [in] ; as well as all-sky surveys for unknown rotating neutron stars [HI El [12] ■ 
The first type of searches typically use matched filtering techniques and are not very 
computationally expensive. The second type of searches look for as yet undiscovered 
sources. This involves searching over large parameter space volumes and turns out to 
be computationally limited, as the number of templates that must be searched over 
increase rapidly with the observation time. The ultimate goal for wide parameter 
searches for continuous signals over large data sets is to employ hierarchical schemes 
which alternate coherent and semi-coherent techniques [131 [Ml [151 [TBI [171 [18] , as those 
currently employed by Einstein@Home [19j . a distributed-computing effort that uses 
the idle CPU time of computers across the world. 

The Hough transform [5D1 [ST] is an example of a semi-coherent method that can 
be used to select candidates in parameter space to be followed up. Results of the 
Hough transform to search the entire sky have been reported in [TTl [12] . In those 
papers, the Hough transform was used to search for cumulative excess power from 
a hypothetical periodic signal by examining successive spectral estimates based on 
short Fourier transforms (SFTs) of the calibrated detector data, taking into account 
the Doppler frequency shift due to the motion of the detector with respect to the solar 
system barycenter and the intrinsic frequency evolution of the source. Two flavors 
of the Hough transform have been developed and employed for different searches, 



A veto for continuous gravitational wave searches 



2 



the 'standard Hough' [IHl E] and the 'weighted Hough' [UII^. In the 'standard 
Hough', the cumulative excess power is computed as the sum of binary zeroes and 
ones, where a SFT contributes unity if the power exceeds a normahzed threshold, and 
in the 'weighted Hough' the contribution of the SFTs is weighted according to the 
noise and detector antenna pattern to maximize the signal-to-noise (SNR) ratio. 

In all hierarchical methods it is crucial that the selection of candidates done by the 
semi-coherent stage is as effective as possible, since it determines the final sensitivity 
of the full pipeline. For this reason the development of veto and/or coincidence tests is 
very important in order to reduce the number of false alarms. In this paper we present 
a veto adapted to the Hough transform searches for continuous wave signals and 
we discuss the expected performance of this veto in LIGO analysis. 

discriminators are commonly used in searching for gravitational wave signals. 
In particular, for binary inspiral searches a time-frequency discriminator is used as 
a veto for the output of matched filter, by analyzing the output of different frequency 
bands. This test [551 112 was specifically constructed for broadband signals, but 
it can be modified for signals that are narrow band as the continuous wave signals 
expected from rapidly rotating neutron stars. For these continuous wave signals that 
will be observed over periods of several months, in order to build up the sufficient 
SNR, we propose to split the data into several chunks, analyze each chunk separately 
and construct a statistic by combining the partial results. This statistic would then 
be able to discriminate if the SNR accumulates along the different chunks in a way 
that is consistent with the properties of the signal and the detector noise. 

The rest of the paper is organized as follows: Section [2] briefly summarizes the 
Hough transform and its statistical properties. Section [3] derives a x^ discriminator 
for different implementations of the Hough transform. Section 3] characterizes the x^- 
significance plane and discusses its application using LIGO data. Section [5] concludes 
with a summary of the results. 

2. The Hough transform method 

The Hough transform is a well known method for pattern recognition that has been 
applied to the search for continuous gravitational waves. In this case the Hough 
transform is used to find a signal whose frequency evolution fits the pattern produced 
by the Doppler shift and the spin-down in the time-frequency plane of the data. 
Further details can be found in [20j , here we only give a brief summary and statistical 
properties. 

The starting point for the Hough transform are N SFTs. Each of these SFTs is 
digitized by setting a threshold pth on the normalized power 



Here Xk is the discrete Fourier transform of the data, the frequency index k corresponds 
to a physical frequency of = k/T^^t,, Sn{fk) is the single sided power spectral density 
of the detector noise and T^oi, is the time baseline of the SFT. The k*^ frequency bin 
is selected if pk > Pth, and rejected otherwise. In this way, each SFT is replaced by a 
collection of zeros and ones called a peak-gram. The probability that a frequency bin 
is selected is q = e~''^^ for Gaussian noise and 77, given by 




(1) 



Pk = 



T^ahSnifk) 




(2) 
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in the presence of a signal. A^, is the signal to noise ratio within a single SFT, and for 
the case when there is no mismatch between the signal and the template: 

_ i\hih)\' 

with h(f) being the Fourier transform of the signal h{t). 

The Hough transform is used to map points from the time-frequency plane of our 
data (understood as a sequence of peak-grams) into the space of the source parameters. 
Each point in parameter space corresponds to a pattern in the time-frequency plane, 
and the Hough number count n is the weighted sum of the ones and zeros of the 
different peak-grams along this curve. For the 'weighted Hough' this sum is computed 
as 

N 

n ^ WiUi , (4) 
i=i 

where is either or 1 depending on where the power crosses the threshold and the 
weights are normalized according to 

N 

= iV . (5) 

1=1 

When all = 1 we obtain the 'standard Hough'. The 'weighted Hough' can improve 
the sensitivity of the search taking into account the possible non-stationarities of the 
detector noise and the amplitude modulation due to the motion of the detector, and 
allow for multi-interferometer searches. 

For large values of N, the number count distribution n can be considered a 
continuous variable and well approximated by a Gaussian distribution: 

In the absence of a signal, the mean and variance are 

N 

{n)^Nq and ^ q{l - q)^,^^ (7) 

i=l 

and in the presence of a signal, the mean and variance of n become 

N N 

(n) ^ qN + ^ WiXi and = E ^i^^i^ ~ ■ (8) 

4=1 4=1 

Because of the weights a varies for different sky locations we should not compare 
number counts directly but the significance of a number count. The significance s of 
the observed number-counts n is defined as 

s=^^, (9) 

(T 

where (n) and a are the expected mean and standard deviation for pure noise. 
Furthermore, one can see that setting a threshold at a given false alarm rate a is 
equivalent to set a threshold at a certain significance [H] 



y2erfc-^(2a). (10) 
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3. The veto 

In this section wc derive a discriminator for the different implementations of the 
Hough transform. The idea is to spht the data into p non-overlapping chunks, each of 
them containing a certain number of SFTs {Ni, N2, . . . , Np}, such that 
p 

^7V,=iV, (11) 

and analyze them separately, obtaining the Hough number-count nj which, for the 
same pattern across the different chunks, would then satisfy 
p 

Y,n,=n, (12) 
i=i 

where n is the total number-count for a given point in parameter space. The 
statistic will look along the different chunks to see if the SNR accumulates in a way 
that is consistent with the properties of the signal and the detector noise. Small values 
of x^ are consistent with the hypothesis that the observed SNR (or more precisely the 
significance) arose from a detector output which was a linear combination of Gaussian 
noise and the continuous wave signal. Large values of x^ indicate either the signal did 
not match the template or that the detector noise was non-Gaussian. 



3.1. The standard Hough 

In the simplest case in which all weights are set to unity, it is easy to see that the 
expected values of the number counts are 

(n)-iV,;, nl^Nr^{l~T^), (13) 

(n) 

where n is the total measured number count and 77 is the probability of selecting a 
peak in the presence of a signal. 

Consider the p quantities defined by 

Anj=nj-^n. (15) 
With this definition, it holds true that 

P AT. 

(An,)=0, EAn,=0 ,{n^n) = ^{n^), (16) 
i=i 

and the expectation value of the square of Auj is 

N. ' 

Therefore we can define the x^ discriminator statistic by 

._f^(An,)^ ^ {n,-nN,/Nf 



((An,)2)= l--f iV,,y(l-,?). (17) 
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and expected value of is 

(x^)-E(l-f)=^-l- (19) 

Equation (fT5|) has a ^^-distribution with p — I degrees of freedom. To implement 
this discriminator, we need to measure, for each point in parameter space, the 
total number-count n, the partial number-counts Uj and assume a constant value 
of 77 = n/N. 



3.2. The weighted Hough 

We can generalize the previous result for the weighted Hough transform. Let Ij be 
the set of SFT indices for each different p, the mean and variance of the number-count 
become 

p 

(n,) = ^ w,;77, (n) = ^(n,) al^ = ^ ^2^,(1 - 77,) (20) 

ie/j j=l iG/j 

and we can define 

Auj = - n— iv , (21) 

so that (Ajij) = 0, ^"-j = 0- The discriminator would now be: 

X = E = E ^ ^ 2 n ^ ■ 22 

In a given search, we can compute the Eie/j Eie/j fo^' ^s^ch of the p chunks. 
The problem for implementing this discriminator is that the different r]i values 
can not be measured from the data itself because they depend on the exact SNR for 
a single SFT as defined in Eqns. ^ and ^ and not just its averaged value. For this 
reason we propose to approximate in equation (|22[) rji jy*, where 77* = n/N. Thus 

E „*n _ ,,,2 ■ (2^) 



4. Application of the veto on the LIGO fourth science run 

We use as playground the SFT data produced during LIGO's 29.5-day fourth science 
run (S4) and analyze it by means of the "weighted Hough" scheme, combining the data 
of the three LIGO detectors (HI, H2, LI). The SFTs were generated directly from 
the calibrated data stream, using 30-minute intervals of contiguous data for which the 
interferometer is operating in what is known as science-mode, having in the end 1004 
SFTs from HI, 1063 SFTs from H2 and 899 from LI. An all-sky search for periodic 
gravitational waves in the frequency range 50- 1000 Hz and with the frequency's time 
derivative in the range —1 x 10~^-0 Hz s""'^ using the S4 data is reported in |12j . 
In that paper, three different semi-coherent methods of transforming and summing 
strain power from SFT data have been used. In [T^, to identify the most interesting 
subset in parameter space, a fixed threshold of SNR or significance of 7 was applied 
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Figure 1. versus significance obtained for the 90-100 Hz band and 

comparison of the values obtained with a distribution with p — 1 degrees of 
freedom. The top figures correspond to p = 8 and the bottom ones to p = 16. 



for all three searches, and candidates were analyzed by means of a simple coincident 
test. The paper reported no evidence of periodic gravitational radiation. 

To characterize the ^^-significance plane in order to discriminate between 
instrumental noise and real signals, we first study some small frequency bands of 
the S4 data. The main purpose is to verify that in those bands free of large spectral 
disturbances, the measured distribution is consistent with what we would expect in 
the case of Gaussian noise. For this we split the data into p = 8 or p = 16 segments. 
It is reasonable to split the data into segments with a similar relative contribution 
to the total number count. Therefore, we split the SFTs in such a way that the 
sum of weights into each block satisfies ~ ^/P- In figure [1] we show the 

distributions in the 90-100 Hz frequency band. The results agree very well with the 
expected theoretical distribution. This test was carefully done avoiding the 1 Hz comb 
present in the data. 

The next step is to characterize the x^-significance plane in the presence of signals. 
If there was no mismatch between the signals and the templates, and if we could 
compute exactly the value given by Eqn. ([22|) instead of ([23]), we would obtain 
the same distribution as for the Gaussian noise only case. But in a real search, 
templates are placed on a grid and due to the mismatch there is a dependency of the x^ 
values with the significance [23] . For this reason we select 22 frequency bands between 
50 and 1000 Hz free of spectral disturbances and we analyze them by means of Monte- 
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Figure 2. versus significance for software injected pulsar signals. We 

represent the results for four different frequency bands of 0.8 Hz free of spectral 
disturbances (avoiding the 1 Hz comb) using p = 16, together with the best fitted 
quadratic curve of the envelope of the points obtained. 



Carlo software injections. For each of these bands we inject at least 10000 artificial 
signals of different amplitudes, frequencies, inclination angles and sky locations. Wc 
have checked that 10000 signals was enough for our purposes. More extensive analysis 
were performed in the 91-100 Hz band, in which we injected up to 100000 signals 
without observing any considerable impact in the final result. In figure [5] we represent 
the results obtained for four of these different bands. The Monte-Carlo injections have 
been analyzed with no mismatch and also with a small mismatch between the signals 
and the templates, using the same grid separation that was employed in the S4 search 
|12j . It is worth mentioning that this grid was not based on a metric approach that 
would guarantee a maximum given mismatch at any point in parameter space, but 
it was uniformly spaced in frequency and frequency derivative and used an almost 
isotropic grid, but frequency dependent, in the sky. As a consequence of this choice, 
together with the fact that the S4 run was shorter than a month, the mismatch depends 
on the sky location and the frequency. 

For each of the analyzed bands we find a veto curve. The way we proceed is the 
following: we first sort the points with respect to the significance, we group them in 
sets containing equal number of points (typically 50) and we pick the one with the 
largest value (the highlighted stars in Fig. With the set of selected points, we fit 



A veto for continuous gravitational wave searches 



8 




Figure 3. Veto of the violin modes by the using p = 16. In the top panels 
there is the x^- significance plane and the solid line corresponds to the veto curve 
at an average frequency in each case. The bottom panels show the significance 
versus frequency before and after applying the veto. The test successfully 
vetoes the strongest lines related to the violin modes while it fails to veto others, 
e.g. the line at 342 Hz. The dots below the veto curve with small values of 
significance arc consistent with the expected distribution in the case of Gaussian 
noise. 



A 



a curve to the upper contour of the x^-significance planes. From the set of these curves 
we deduce empirically the best parameters of a quadratic curve = + Bs + C 
valid for any frequency between 50 and 1000 Hz and for significance values greater 
than 5. For the S4 data and using p ~ 16, these parameters are: 

-4.229- 10-4/ + 0.1274 50 < / < 300 
300 </ < 1000 

5=^ + 0.7873 

C = 18.666 

We want to point out that this veto curve, in this case, was only valid for significance 
values greater than 5. In a search, one will set a threshold on the significance equal 
or higher than this value (a threshold of 7 was used for selecting triggers in the S4 
search in [12j). and for each trigger one will compare if the value is above or below 
the veto curve. 

Using this veto curve we have analyzed the whole S4 data and here we show 
details of some frequency bands. This discriminator is able to veto all the violin 
modes present in the data and many other narrow instrumental lines, as it is shown 
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Figure 4. Same as figure \3\ for the frequency bands that contain the hardware 
injected Pulsar 3 (left) and Pulsar 8 (right). In the top panels, the dots on top 
of the solid line correspond to the hardware injected signals that are vetoed. 
The dots below the curve, with low values of significance, are consistent with the 
expected distribution in the Gaussian noise case and correspond to the frequencies 
not affected by the presence of the hardware injections. The test is able to 
veto these signals because there were injected in the data in an intermittent way 
and therefore they did not behave like the signals we are looking for. 



in figure [21 In this figure one can see liow tlie strong lines are clearly vetoed while this 
test failed to veto others, e.g. the line at 342 Hz. 

During the S4 run ten artificial pulsar signals were hardware injected in the data 
in an intermittent way. Four of these pulsars Pulsar2, PulsarS, PulsarS and PulsarQ 
were strong enough to be detected by the multi-interferometer Hough search (see |12| 
for further details). Because these signals are not continuously present in the data, the 
test is able to veto them as it is shown in figureH) Of course, this does not happen if 
we analyze only the data segments when the injections took place. In those cases, the 
X^-significance plane is consistent with the one obtained for software injected signals. 
It is worth mention that this test failed to veto all the 60 Hz line harmonics. 

5. Conclusions 

In this paper we have presented a new veto adapted to the Hough transform search 
for continuous gravitational wave signals and discussed the performance of this veto 
using the data from LIGO fourth science run. The implementation of this veto is very 
simple and does not imply a considerable increase in computational cost. We foresee its 
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usage in future searches performed by the LIGO and VIRGO Seientific CoUaboration. 
The veto presented here is adapted to the Hough transform but it could be 
generahzed for other semi-coherent techniques such as PowerFlux or StackShde, and 
also when the Hough transform starts with the maximum detection statistic (known 
as JF-Statistics l9l[20|) rather than SFT power as the input data. 
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